Search Results/Filters    

Filters

Year

Banks




Expert Group











Full-Text


Issue Info: 
  • Year: 

    2021
  • Volume: 

    8
  • Issue: 

    2
  • Pages: 

    1-18
Measures: 
  • Citations: 

    0
  • Views: 

    41
  • Downloads: 

    4
Abstract: 

In recent years, Question Answering systems have become more popular and widely used by users. Despite the increasing popularity of these systems, their performance is not even sufficient for textual data and requires further research. These systems consist of several parts that one of them is the Answer Selection component. This component detects the most relevant answer from a list of candidate answers. The methods presented in previous researches have attempted to provide an independent Model to undertake the answer-selection task. An independent Model cannot comprehend the syntactic and semantic features of questions and answers with a small training dataset. To fill this gap, language Models can be employed in implementing the answer selection part. This action enables the Model to have a better understanding of the language in order to understand questions and answers better than previous works. In this research, we will present the 'BAS' stands for BERT Answer Selection that uses the BERT language Model to comprehend language. The empirical results of applying the Model on the TrecQA Raw, TrecQA Clean, and WikiQA datasets demonstrate that using a robust language Model such as BERT can enhance the performance. Using a more robust classifier also enhances the effect of the language Model on the answer selection component. The results demonstrate that language comprehension is an essential requirement in natural language processing tasks such as answer selection.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 41

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 4 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2021
  • Volume: 

    7
Measures: 
  • Views: 

    186
  • Downloads: 

    0
Abstract: 

Intent classification is one of the important tasks in natural language understanding which aims to classify queries based on their intent, goal, or purpose which is implied in the content. However, the problem in this task is the lack of data labeled by a human agent. This problem leads to weakness in the generalization of Models, especially when the Models face rare words. Using pre-trained Models can be useful in the generalization of language representation. BERT pre-trained language Model which has been currently published has made a major impact in the natural language processing field. This Model which is trained on unlabeled large-scale corpora, with fine-tuning could achieve state-of-art results in various natural language processing tasks e. g. question answering and sentiment analysis. In this paper, we compared the BERT Model with conventional machine learning Models and shown that the BERT Model has a better performance than conventional machine learning Models in few-shot learning.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 186

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0
Issue Info: 
  • Year: 

    2024
  • Volume: 

    22
  • Issue: 

    1
  • Pages: 

    21-29
Measures: 
  • Citations: 

    0
  • Views: 

    90
  • Downloads: 

    14
Abstract: 

In today's information age, efficient document ranking plays a crucial role in information retrieval systems. This article proposes a new approach to document ranking using embedding Models, with a focus on the BERT language Model to improve ranking results. The proposed approach uses vocabulary embedding methods to represent the semantic representations of user queries and document content. By converting textual data into semantic vectors, the relationships and similarities between queries and documents are evaluated under the proposed ranking relationships with lower cost. The proposed ranking relationships consider various factors to improve accuracy, including vocabulary embedding vectors, keyword location, and the impact of valuable words on ranking based on semantic vectors. Comparative experiments and analyses were conducted to evaluate the effectiveness of the proposed relationships. The empirical results demonstrate the effectiveness of the proposed approach in achieving higher accuracy compared to common ranking methods. These results indicate that the use of embedding Models and their combination in proposed ranking relationships significantly improves ranking accuracy up to 0. 87 in the best case. This study helps improve document ranking and demonstrates the potential of the BERT embedding Model in improving ranking performance.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 90

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 14 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Journal: 

Karafan

Issue Info: 
  • Year: 

    2023
  • Volume: 

    20
  • Issue: 

    1
  • Pages: 

    341-362
Measures: 
  • Citations: 

    0
  • Views: 

    113
  • Downloads: 

    36
Abstract: 

Seljuk Pretrained language Models are very important because of their application in issues related to natural language processing. Language Models such as BERT have become more popular among researchers. Due to the focus of these language Models on English, other languages ​​are limited to some multilingual Models. In this article, the PersianSportBERT language Model is presented for the purpose of Persian sports analysis in topics related to this linguistic field. This language Model is based on the BERT language Model and was trained using the collected dataset. Three problems were used to evaluate the new language Model: sentiment analysis, named entity recognition and text infilling. In order to train this language Model, due to the lack of a suitable dataset, a wide range of sports events and news in the Persian language was prepared from several online sources. Due to the specialization of this Model and compared to the language Models presented for the Persian language, this Model provided better results in all three problems. This Model had the best performance with 71. 7% and 95. 2% in text infilling and named entity recognition, respectively. In sentiment analysis, the sports Model presented better results. These findings demonstrate that using a language Model related to any specialized field will have better results compared to language Models related to the general field of texts.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 113

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 36 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Author(s): 

DAS A. | GANTAIT S. | MANDAL N.

Issue Info: 
  • Year: 

    2011
  • Volume: 

    6
  • Issue: 

    -
  • Pages: 

    40-48
Measures: 
  • Citations: 

    1
  • Views: 

    169
  • Downloads: 

    0
Keywords: 
Abstract: 

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 169

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 1 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Issue Info: 
  • Year: 

    2023
  • Volume: 

    21
  • Issue: 

    2
  • Pages: 

    89-100
Measures: 
  • Citations: 

    0
  • Views: 

    237
  • Downloads: 

    36
Abstract: 

Using the context and order of words in sentence can lead to its better understanding and comprehension. Pre-trained language Models have recently achieved great success in natural language processing. Among these Models, The BERT algorithm has been increasingly popular. This problem has not been investigated in Persian language and considered as a challenge in Persian web domain. In this article, the embedding of Persian words forming a sentence was investigated using the BERT algorithm. In the proposed approach, a Model was trained based on the Persian web dataset, and the final Model was produced with two stages of fine-tuning the Model with different architectures. Finally, the features of the Model were extracted and evaluated in document ranking. The results obtained from this Model are improved compared to results obtained from other investigated Models in terms of accuracy compared to the multilingual BERT Model by at least one percent. Also, applying the fine-tuning process with our proposed structure on other existing Models has resulted in the improvement of the Model and embedding accuracy after each fine-tuning process. This process will improve result in around 5% accuracy of the Persian web ranking.

Yearly Impact: مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 237

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 36 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesCitation 0 مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesRefrence 0
Writer: 

مهدی-جلالی

Issue Info: 
  • End Date: 

    مهر 1384
Measures: 
  • Citations: 

    0
  • Views: 

    251
  • Downloads: 

    0
Keywords: 
Abstract: 

قطعه فوق یک قطعه استراتژیک در صنعت حفاری است که دانش فنی آن را جهاد تهیه کرده است. دانش فنی این قطعه شامل مشخصات مکانیکی و متالورژیکی، نقشه فنی و نقشه بازرسی و همچنین اسکوپ بازرسی است.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 251

Issue Info: 
  • Year: 

    2020
  • Volume: 

    6
Measures: 
  • Views: 

    156
  • Downloads: 

    256
Abstract: 

Social media like Twitter have become very popular in recent decades. Hashtags are new kind of metadata which make non-structured tweets into searchable semistructured content. There are varied previous methods which recommend hashtags for new tweets. However, to the best of our knowledge, there is no previous word that uses BERT embedding for this purpose. In this paper, we propose a new method called EmHash that uses neural network based on BERT embedding to recommend new hashtags for each tweet. Unlike other word embeddings, BERT embedding constructs different vectors for the same word in different contexts. Emhash succeeded in outperforming three methods LDA, SVM, and TTM with respect to recall measure.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 156

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 256
Issue Info: 
  • Year: 

    2024
  • Volume: 

    10
Measures: 
  • Views: 

    79
  • Downloads: 

    4
Abstract: 

The fake news story refers to information falsified or made up to manipulate or deceive the public. It is a serious issue that undermines the credibility of information sources and influences public opinion, discourse, and important decisions and outcomes. The content and source of news articles and stories must be considered when identifying and classifying fake news. Detecting fake news is an important and active research area with many potential applications and implications for society. Many challenges are involved in determining whether a news article is authentic. Several deep learning Models that employ natural language processing (NLP) have shown excellent results in detecting fake news. To assess the truthfulness of news articles, our methodology is based on state-of-the-art language Models based on transformers. Bidirectional Encoder Representation from Transformers (BERT) and Robustly Optimized BERT Technique (RoBERTa) is one of the most advanced Models. Our findings reveal that the BERT Model achieved an accuracy of 64%, while the RoBERTa Model slightly outperformed it with an accuracy of 66%. These results are particularly significant when compared to similar research in this domain, which reported a maximum accuracy of 62% for both Models on Liar dataset.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 79

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 4
Issue Info: 
  • Year: 

    2020
  • Volume: 

    6
Measures: 
  • Views: 

    117
  • Downloads: 

    80
Abstract: 

Community Question-Answering websites, such as StackOverflow and Quora, expect users to follow specific guidelines in order to maintain content quality. These systems mainly rely on community reports for assessing contents, which has serious problems, such as the slow handling of violations, the loss of normal and experienced users' time, the low quality of some reports, and discouraging feedback to new users. Therefore, with the overall goal of providing solutions for automating moderation actions in Q&A websites, we aim to provide a Model to predict 20 quality or subjective aspects of questions in QA websites. To this end, we used data gathered by the CrowdSource team at Google Research in 2019 and fine-tuned pre-trained BERT Model on our problem. Based on our evaluation, Model achieved value of 0. 046 for Mean-Squared-Error (MSE) after 2 epochs of training, which did not improve substantially in the next ones. Results confirm that by simple fine-tuning, we can achieve accurate Models in little time and on less amount of data.

Yearly Impact:   مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View 117

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic ResourcesDownload 80
litScript
telegram sharing button
whatsapp sharing button
linkedin sharing button
twitter sharing button
email sharing button
email sharing button
email sharing button
sharethis sharing button